Music service with motion video

ABSTRACT

Techniques of providing motion video content along with audio content are disclosed. In some example embodiments, a computer-implemented system is configured to perform operations comprising: receiving primary audio content; determining that at least one reference audio content satisfies a predetermined similarity threshold based on a comparison of the primary audio content with the at least one reference audio content; for each one of the at least one reference audio content, identifying motion video content based on the motion video content being stored in association with the one of the at least one reference audio content and not stored in association with the primary audio content; and causing the identified motion video content to be displayed on a device concurrently with a presentation of the primary audio content on the device.

TECHNICAL FIELD

The present application relates generally to the technical field of dataprocessing, and, in various embodiments, to methods and systems ofproviding motion video content along with audio content.

BACKGROUND

The presentation of audio content often lacks corresponding motion videocontent. Combining the presentation of motion video content with thepresentation such audio content presents many technical challenges,including, but not limited to, determining what motion video content touse and how to combine the motion video content with the audio content.

BRIEF DESCRIPTION OF THE DRAWINGS

Some embodiments of the present disclosure are illustrated by way ofexample and not limitation in the figures of the accompanying drawings,in which like reference numbers indicate similar elements.

FIG. 1 is a block diagram illustrating a network environment suitablefor providing motion video content along with audio content, inaccordance with some example embodiments.

FIG. 2 illustrates a comparison of primary audio content with aplurality of reference audio content, in accordance with some exampleembodiments.

FIGS. 3A-3B are conceptual diagrams illustrating the synchronizing ofmotion video content with primary audio content, in accordance with someexample embodiments.

FIGS. 4A-4B are conceptual diagrams illustrating the synchronizing ofdifferent motion video contents with primary audio content, inaccordance with some example embodiments.

FIG. 5 is a flowchart illustrating a method of providing motion videocontent along with audio content, in accordance with some exampleembodiments.

FIG. 6 is a flowchart illustrating a method of causing motion videocontent to be displayed on a device concurrently with the presentationof audio content on the device, in accordance with some exampleembodiments.

FIG. 7 is a block diagram illustrating a mobile device, in accordancewith some example embodiments.

FIG. 8 is a block diagram of an example computer system on whichmethodologies described herein can be executed, in accordance with someexample embodiments.

DETAILED DESCRIPTION

Example methods and systems of providing motion video content along withaudio content are disclosed. In the following description, for purposesof explanation, numerous specific details are set forth in order toprovide a thorough understanding of example embodiments. It will beevident, however, to one skilled in the art that the present embodimentsmay be practiced without these specific details.

In some example embodiments, a computer-implemented implemented methodcomprises: receiving primary audio content; determining that at leastone reference audio content satisfies a predetermined similaritythreshold based on a comparison of the primary audio content with the atleast one reference audio content; for each one of the at least onereference audio content, identifying motion video content based on themotion video content being stored in association with the one of the atleast one reference audio content and not stored in association with theprimary audio content; and causing the identified motion video contentto be displayed on a device concurrently with a presentation of theprimary audio content on the device. In some example embodiments, theprimary audio content comprises music.

In some example embodiments, the comparison comprises a comparison offingerprint data of the primary audio content with fingerprint data ofthe at least one reference audio content.

In some example embodiments, the identifying of the motion video contentis further based on a profile of a user associated with the device.

In some example embodiments, the causing the identified motion videocontent to be displayed on the device concurrently with the presentationof the primary audio content on the device comprises: synchronizing dataof the at least one reference audio content with data of the primaryaudio content; and synchronizing the identified motion video contentwith the primary audio content based on the synchronizing of the data ofthe at least one reference audio content with the data of the primaryaudio content. In some example embodiments, the synchronizing of thedata of the at least one reference audio content with the data of theprimary audio content is based on a comparison of fingerprint data ofthe at least one reference audio content with fingerprint data of theprimary audio content.

In some example embodiments, the at least one reference audio contentcomprises at least two reference audio content, each one of the at leasttwo reference audio content being stored in association with differentmotion video content, the identified motion video content comprising aportion of each of the different motion video content.

The methods or embodiments disclosed herein can be implemented as acomputer system having one or more modules (e.g., hardware modules orsoftware modules). Such modules can be executed by one or moreprocessors of the computer system. The methods or embodiments disclosedherein can be embodied as instructions stored on a machine-readablemedium that, when executed by one or more processors, cause the one ormore processors to perform the instructions.

FIG. 1 is a block diagram illustrating a network environment 100suitable for providing motion video content along with audio content, inaccordance with some example embodiments. The network environment 100includes a content provider 110, one or more devices 130, and one ormore data sources 140 (e.g., data source 140-1 through data source140-N), all of which are communicatively coupled to each other via anetwork 120. The content provider 110, the device(s) 130, and the datasource(s) 140 may each be implemented in a computer system, in whole orin part, as described below with respect to FIG. 8.

Also shown in FIG. 1 is a user 132. The user 132 may be a human user(e.g., a human being), a machine user (e.g., a computer configured by asoftware program to interact with the device 130), or any suitablecombination thereof (e.g., a human assisted by a machine or a machinesupervised by a human). The user 132 is not part of the networkenvironment 100, but is associated with the device 130 and may be a userof the device 130. For example, the device 130 may be a desktopcomputer, a vehicle computer, a tablet computer, a navigational device,a portable media device, or a smart phone belonging to the user 132.

Any of the machines, providers, modules, databases, devices, or datasources shown in FIG. 1 may be implemented in a computer modified (e.g.,configured or programmed) by software to be a special-purpose computerto perform one or more of the functions described herein for thatmachine, provider, module, database, device, or data source. Forexample, a computer system able to implement any one or more of themethodologies described herein is discussed below with respect to FIG.8. As used herein, a “database” is a data storage resource and may storedata structured as a text file, a table, a spreadsheet, a relationaldatabase (e.g., an object-relational database), a triple store, ahierarchical data store, or any suitable combination thereof. Moreover,any two or more of the databases, devices, or data sources illustratedin FIG. 1 may be combined into a single machine, and the functionsdescribed herein for any single database, device, or data source may besubdivided among multiple databases, devices, or data sources.

The network 120 may be any network that enables communication between oramong machines, databases, and devices. Accordingly, the network 120 maybe a wired network, a wireless network (e.g., a mobile or cellularnetwork), or any suitable combination thereof. The network 120 mayinclude one or more portions that constitute a private network, a publicnetwork (e.g., the Internet), or any suitable combination thereof.Accordingly, the network 120 may include one or more portions thatincorporate a local area network (LAN), a wide area network (WAN), theInternet, a mobile telephone network (e.g., a cellular network), a wiredtelephone network (e.g., a plain old telephone system (POTS) network), awireless data network (e.g., WiFi network or WiMax network), or anysuitable combination thereof. Any one or more portions of the network190 may communicate information via a transmission medium. As usedherein, “transmission medium” shall be taken to include any intangiblemedium that is capable of storing, encoding, or carrying instructionsfor execution by a machine, and includes digital or analog communicationsignals or other intangible media to facilitate communication of suchsoftware.

The content provider 110 comprises a computer system configured toprovide audio content and video content to devices, such as device 130.In some example embodiments, the content provider 110 comprises anycombination of one or more of a reference determination module 112, avideo identification module 114, a presentation module 116, and one ormore databases 118. In some example embodiments, the modules 112, 114,and 116, and the database(s) 118 reside on a machine having a memory andat least one processor. In some example embodiments, the modules 112,114, and 116, and the database(s) 118 reside on the same machine, whilein other example embodiments, one or more of modules 112, 114, and 116and database(s) 118 reside on separate remote machines that communicatewith each other via a network, such as network 120.

In some example embodiments, the reference determination module 112 isconfigured to receive primary audio content. The audio content maycomprise music, such as a single song recording. However, it iscontemplated that other types of audio content are also within the scopeof the present disclosure. In some example embodiments, the referencedetermination module 112 is configured to identify, or otherwisedetermine, at least one reference audio content that satisfies apredetermined similarity threshold based on a comparison of the primaryaudio content with the reference audio content. For example, thereference determination module 112 may search a plurality of referenceaudio content stored on database(s) 118 for reference audio content thatsatisfies the predetermined similarity threshold. Additionally, oralternatively, the reference determination module 112 may search one ormore external data sources 140 for reference audio content thatsatisfies the predetermined similarity threshold. The external datasources 140 may comprise data sources that are separate and independentfrom the content provider 110.

In some example embodiments, the comparison of the primary audio contentwith the reference audio content comprises a comparison of data of theprimary audio content with data of the reference audio content. The databeing compared may comprise fingerprint data that uniquely identifies orcharacterizes the corresponding audio content. FIG. 2 illustrates acomparison of primary audio content with a plurality of reference audiocontent, in accordance with some example embodiments. In FIG. 2,fingerprint data 212 of primary audio content 210 is compared with aplurality of fingerprint data 222 (e.g., fingerprint data 222-1, . . . ,fingerprint data 222-N) of a plurality of reference audio content 220(e.g., reference audio content 220-1, . . . , reference audio content220-N). In some example embodiments, each comparison generatescorresponding statistical data that indicates a level of similaritybetween the primary audio content and the reference audio content. Oneexample of such statistical data is a bit error rate. However, it iscontemplated that other statistical data is also within the scope of thepresent disclosure. In some example embodiments, the referencedetermination module 112 determines if the statistical datacorresponding to a reference audio content 220 satisfies a predeterminedthreshold.

In some example embodiments, the reference determination module 112 isconfigured to use an exact fingerprint match between the fingerprintdata 212 of the primary audio content 210 and the fingerprint data 222of the reference audio content 220 as the predetermined threshold. Forexample, the reference determination module 112 may search through theplurality of reference audio content 220 to match one version of anaudio recording (e.g., compressed or noisy) to a version of the sameaudio recording that is not degraded.

In some example embodiments, the reference determination module 112 isconfigured to use a fuzzy fingerprint match between the fingerprint data212 of the primary audio content 210 and the fingerprint data 222 of thereference audio content 220 as the predetermined threshold. For example,the reference determination module 112 may search through the pluralityof reference audio content 220 and match a recording of a song (or atheatrical performance, narration, or the like) to a differentperformance or recording of the same song (or theatrical performance,narration, or the like).

In some example embodiments, the reference determination module 112 isconfigured to use a match between audio characteristics, such as chords,rhythmic features, instrument onsets, and the like, of the primary audiocontent 210 and such audio characteristics of the reference audiocontent 220 as the predetermined threshold. For example, the referencedetermination module 112 may search through the plurality of referenceaudio content 220 and match one audio recording with another audiorecording simply based on particular level of similarity between theaudio characteristics of the different audio recordings, such asmatching two different songs based on a determination of a high level ofsimilarity between the rhythmic features of the two different songs.

In some example embodiments, for one or more of the reference audiocontent 220 that is determined to satisfy the similarity threshold, thevideo identification module 114 identifies motion video content based onthe motion video content being stored in association with the referenceaudio content and not stored in association with the primary audiocontent. In some example embodiments, the video identification module114 is configured to identify the motion video content also based on aprofile of a user 132 associated with the device 130 to which thecombination of the primary audio content and the identified motion videocontent will be presented. In some example embodiments, user profilesare stored in databases) 118. The profile of the user 132 may compriseany combination of one or more of a history of audio content listened toby the user 132, indications that the user 132 likes certain types orcategories of audio content, purchase history of audio content, ahistory of motion video content viewed by the user 132, indications thatthe user 132 likes certain types or categories of motion video, anddemographic information about the user 132 (e.g., gender, age,geographic location). Other types of information indicating potentialpreferences for certain types of audio content or certain types of videocontent may also be included in the profile of the user 132. Inscenarios where several different motion video contents satisfy thesimilarity threshold, the video identification module 114 may use theprofile of the user 132 to select one or more motion video contentsbased on a determination of what motion video content is most relevantto the user 132.

In some example embodiments, the presentation module 116 is configuredto cause the motion video content identified by the video identificationmodule 114 to be displayed on the device 130 concurrently with apresentation of the primary audio content on the device 132. In someexample embodiments where the primary audio content comprises a song,the content provider 110 has thus dynamically created a music video fora song for which the content provider 110 was storing a music video.

In some example embodiments, the presentation module 116 is configuredto synchronize data of the reference audio content with data of theprimary audio content, and then synchronize the identified motion videocontent with the primary audio content based on the synchronizing of thedata of the reference audio content with the data of the primary audiocontent. In some example embodiments, the synchronizing of the data ofthe reference audio content with the data of the primary audio contentis based on a comparison of fingerprint data of the reference audiocontent with fingerprint data of the primary audio content.

FIGS. 3A-3B are conceptual diagrams illustrating the synchronizing ofmotion video content with primary audio content, in accordance with someexample embodiments. In FIG. 3A, a primary audio content 210 is shown asbeing made up of audio segments 310-1, 310-2, 310-3, and 310-4, andreference audio content 220 is shown as being made up of audio segments320-1, 320-2, 320-3, and 320-4. Reference audio content 220 is alsoshown as being stored in association with motion video content 320,which is shown as being made up of video segments 322-1, 322-2, 322-3,and 322-4. It is contemplated that other segmentation configurations arealso within the scope of the present disclosure. In FIG. 3A, the audiosegments 310 of the primary audio content 210 and the audio segments 320of the reference audio content 220 are aligned along a time domain inaccordance with their respective timestamps as a result of thepresentation module 116 synchronizing them. Similarly, the videosegments 322 of the motion video content 320 are aligned with the audiosegments 320 of the reference audio content 220 with which it isassociated.

In FIG. 3B, using the synchronization of the audio segments of the 320of the reference audio content 220 with the audio segments 310 of theprimary audio content 210, as well as the synchronization of the videosegments 322 of the motion video content 320 with the audio segments 320of the reference audio content 220, the presentation module 116synchronizes the video segments 322 of the motion video content 320 withthe audio segments 310 of the primary audio content 210.

In some example embodiments, portions from a plurality of differentmotion video contents associated with a plurality of different referenceaudio contents are combined with the primary audio content. FIGS. 4A-4Bare conceptual diagrams illustrating the synchronizing of differentmotion video contents with primary audio content, in accordance withsome example embodiments. In FIG. 4A, similar to FIG. 3A, a primaryaudio content 210 is shown as being made up of audio segments 310-1,310-2, 310-3, and 310-4, and reference audio content 220 is shown asbeing made up of audio segments 320-1, 320-2, 320-3, and 320-4.Reference audio content 220 is also shown as being stored in associationwith motion video content 320, which is shown as being made up of videosegments 322-1, 322-3, and 322-4. The audio segments 310 of the primaryaudio content 210 and the audio segments 320 of the reference audiocontent 220 are aligned along a time domain in accordance with theirrespective timestamps as a result of the presentation module 116synchronizing them. Similarly, the video segments 322 of the motionvideo content 320 are aligned with the audio segments 320 of thereference audio content 220 with which it is associated.

Additionally, in FIG. 4A, another reference audio content 420 is shownas being made up of audio segments 420-1, 420-2, 420-3, and 420-4.Reference audio content 420 is also shown as being stored in associationwith motion video content 420, which is shown as being made up of videosegments 422-1, 422-2, 422-3, and 422-4. The audio segments 420-1,420-2, 420-3, and 420-4 of the reference audio content 420 and the videosegments 422-1, 422-2, 422-3, and 422-4 are aligned with the audiosegments 310-1, 310-2, 310-3, and 310-4 of the primary audio content210.

Using the synchronizations, the presentation module 116 generates amotion video content 425 from portions of the motion video content 320and portions of the motion video content 420. As a result, video segment322-1 is synchronized with audio segment 310-1, video segment 322-2 issynchronized with audio segment 310-2, video segment 422-3 issynchronized with audio segment 310-3, and video segment 422-4 issynchronized with audio segment 310-4.

In some example embodiments, the presentation module 116 is configuredto synchronize the audio segments 310 of the primary audio content 210with the audio segments 320 of the reference audio content 220 based onan exact fingerprint match between the audio segments 310 of the primaryaudio content 210 and the audio segments 320 of the reference audiocontent 220. For example, the presentation module 116 may synchronizethe audio segments 310 of the primary audio content 210 with the audiosegments 320 of the reference audio content 220 based on a match betweenone version of an audio recording (e.g., compressed or noisy) and aversion of the same audio recording that is not degraded.

In some example embodiments, the presentation module 116 is configuredto synchronize the audio segments 310 of the primary audio content 210with the audio segments 320 of the reference audio content 220 based ona fuzzy fingerprint match between the audio segments 310 of the primaryaudio content 210 and the audio segments 320 of the reference audiocontent 220. For example, the presentation module 116 may synchronizethe audio segments 310 of the primary audio content 210 with the audiosegments 320 of the reference audio content 220 based on a match betweena recording of a song (or a theatrical performance, narration, or thelike) and a different performance or recording of the same song (ortheatrical performance, narration, or the like).

In some example embodiments, the presentation module 116 is configuredto synchronize the audio segments 310 of the primary audio content 210with the audio segments 320 of the reference audio content 220 using amatch between audio characteristics, such as chords, rhythmic features,instrument onsets, and the like, of the audio segments 310 of theprimary audio content 210 with such audio characteristics of the audiosegments 320 of the reference audio content 220. For example, thepresentation module 116 may synchronize the audio segments 310 of theprimary audio content 210 with the audio segments 320 of the referenceaudio content 220 based on particular level of similarity between theaudio characteristics of different audio recordings, such assynchronizing two different songs based on a determination of a highlevel of similarity between the rhythmic features of the two differentsongs.

In some example embodiments, the video identification module 114 and thepresentation module 116 are configured to identify different motionvideo content to synchronize and display concurrently with the sameprimary audio content, thereby changing the video experience for thesame primary audio content from one playback to the next. The change inthe video experience from one presentation of the primary audio contentto the next may be partial, such as by swapping out one video segment orone scene for another video segment or another scene, while maintainingat least one video segment or one scene from one presentation to thenext. Alternatively, the change in the video experience from onepresentation of the primary audio content to the next may be complete,such as by replacing all of the video segments used for a presentationof the primary audio content with completely different video segmentsfor a subsequent presentation of the primary audio content. For example,on one day, video content covering an entire live performance of a songmay be synchronized and displayed concurrently with primary audiocontent, and then on a subsequent day, video content covering an entireperformance of the same song in a studio (e.g., different from the liveperformance) may be synchronized and displayed concurrently with thesame primary audio content instead of the video content covering theentire live performance of the song. Such changing of the videoexperience may be based on detected changes in popularity of the videocontent (e.g., changes in the total number of YouTube views of videocontent from day to day), or may be based on detected changes inpreferences or behavior of the user who is to be presented with thevideo content along with the primary audio content (e.g., changes in theuser's viewing habits of video content on YouTube), or may be random. Itis contemplated that other factors may be used to cause the changing ofvideo content from one presentation to another.

FIG. 5 is a flowchart illustrating a method 500 of providing motionvideo content along with audio content, in accordance with some exampleembodiments. Method 500 can be performed by processing logic that cancomprise hardware (e.g., circuitry, dedicated logic, programmable logic,microcode, etc.), software (e.g., instructions run on a processingdevice), or a combination thereof. In one example embodiment, the method500 is performed by the content provider 110 of FIG. 1, or anycombination of one or more of its components or modules.

At operation 510, the content provider 110 receives primary audiocontent. In some example embodiments, the primary audio contentcomprises music (e.g., a song). At operation 520, the content provider110 determines that at least one reference audio content satisfies apredetermined similarity threshold based on a comparison of the primaryaudio content with the at least one reference audio content. In someexample embodiments, the comparison comprises a comparison offingerprint data of the primary audio content with fingerprint data ofthe at least one reference audio content. At operation 530, for each oneof the at least one reference audio content, the content provider 110identifies motion video content based on the motion video content beingstored in association with the one of the at least one reference audiocontent and not stored in association with the primary audio content. Insome example embodiments, the identifying of the motion video content isfurther based on a profile of a user associated with the device. Atoperation 540, the content provider 110 causes the identified motionvideo content to be displayed on a device concurrently with apresentation of the primary audio content on the device. It iscontemplated that any of the other features described within the presentdisclosure can be incorporated into method 500.

FIG. 6 is a flowchart illustrating a method 600 of causing motion videocontent to be displayed on a device concurrently with the presentationof audio content on the device, in accordance with some exampleembodiments. Method 600 can be performed by processing logic that cancomprise hardware (e.g., circuitry, dedicated logic, programmable logic,microcode, etc.), software (e.g., instructions run on a processingdevice), or a combination thereof. In one example embodiment, the method600 is performed by the content provider 110 of FIG. 1, or anycombination of one or more of its components or modules.

At operation 610, the content provider synchronizes data of at least onereference audio content with data of primary audio content. At operation620, the content provider 110 synchronizes identified motion videocontent with the primary audio content based on the synchronizing of thedata of the at least one reference audio content with the data of theprimary audio content. In some example embodiments, the synchronizing ofthe data of the at least one reference audio content with the data ofthe primary audio content is based on a comparison of fingerprint dataof the at least one reference audio content with fingerprint data of theprimary audio content. It is contemplated that any of the other featuresdescribed within the present disclosure can be incorporated into method600.

Example Mobile Device

FIG. 7 is a block diagram illustrating a mobile device 700, according toan example embodiment. The mobile device 700 can include a processor702. The processor 702 can be any of a variety of different types ofcommercially available processors suitable for mobile devices 700 (forexample, an XScale architecture microprocessor, a Microprocessor withoutInterlocked Pipeline Stages (MIPS) architecture processor, or anothertype of processor). A memory 704, such as a random access memory (RAM),a Flash memory, or other type of memory, is typically accessible to theprocessor 702. The memory 704 can be adapted to store an operatingsystem (OS) 706, as well as application programs 708, such as a mobilelocation enabled application that can provide LBSs to a user. Theprocessor 702 can be coupled, either directly or via appropriateintermediary hardware, to a display 710 and to one or more input/output(I/O) devices 712, such as a keypad, a touch panel sensor, a microphone,and the like. Similarly, in some embodiments, the processor 702 can becoupled to a transceiver 714 that interfaces with an antenna 716. Thetransceiver 714 can be configured to both transmit and receive cellularnetwork signals, wireless data signals, or other types of signals viathe antenna 716, depending on the nature of the mobile device 700.Further, in some configurations, a GPS receiver 718 can also make use ofthe antenna 716 to receive GPS signals.

Modules, Components and Logic

Certain embodiments are described herein as including logic or a numberof components, modules, or mechanisms. Modules can constitute eithersoftware modules (e.g., code embodied on a machine-readable medium or ina transmission signal) or hardware modules. A hardware module is atangible unit capable of performing certain operations and can beconfigured or arranged in a certain manner. In example embodiments, oneor more computer systems (e.g., a standalone, client, or server computersystem) or one or more hardware modules of a computer system (e.g., aprocessor or a group of processors) can be configured by software (e.g.,an application or application portion) as a hardware module thatoperates to perform certain operations as described herein.

In various embodiments, a hardware module can be implementedmechanically or electronically. For example, a hardware module cancomprise dedicated circuitry or logic that is permanently configured(e.g., as a special-purpose processor, such as a field-programmable gatearray (FPGA) or an application-specific integrated circuit (ASIC)) toperform certain operations. A hardware module can also compriseprogrammable logic or circuitry (e.g., as encompassed within ageneral-purpose processor or other programmable processor) that istemporarily configured by software to perform certain operations. Itwill be appreciated that the decision to implement a hardware modulemechanically, in dedicated and permanently configured circuitry, or intemporarily configured circuitry (e.g., configured by software) can bedriven by cost and time considerations.

Accordingly, the term “hardware module” should be understood toencompass a tangible entity, be that an entity that is physicallyconstructed, permanently configured (e.g., hardwired) or temporarilyconfigured (e.g., programmed) to operate in a certain manner and/or toperform certain operations described herein. Considering embodiments inwhich hardware modules are temporarily configured (e.g., programmed),each of the hardware modules need not be configured or instantiated atany one instance in time. For example, where the hardware modulescomprise a general-purpose processor configured using software, thegeneral-purpose processor can be configured as respective differenthardware modules at different times. Software can accordingly configurea processor, for example, to constitute a particular hardware module atone instance of time and to constitute a different hardware module at adifferent instance of time.

Hardware modules can provide information to, and receive informationfrom, other hardware modules. Accordingly, the described hardwaremodules can be regarded as being communicatively coupled. Where multipleof such hardware modules exist contemporaneously, communications can beachieved through signal transmission (e.g., over appropriate circuitsand buses) that connect the hardware modules. In embodiments in whichmultiple hardware modules are configured or instantiated at differenttimes, communications between such hardware modules can be achieved, forexample, through the storage and retrieval of information in memorystructures to which the multiple hardware modules have access. Forexample, one hardware module can perform an operation and store theoutput of that operation in a memory device to which it iscommunicatively coupled. A further hardware module can then, at a latertime, access the memory device to retrieve and process the storedoutput. Hardware modules can also initiate communications with input oroutput devices and can operate on a resource (e.g., a collection ofinformation).

The various operations of example methods described herein can beperformed, at least partially, by one or more processors that aretemporarily configured (e.g., by software) or permanently configured toperform the relevant operations. Whether temporarily or permanentlyconfigured, such processors can constitute processor-implemented modulesthat operate to perform one or more operations or functions. The modulesreferred to herein can, in some example embodiments, compriseprocessor-implemented modules.

Similarly, the methods described herein can be at least partiallyprocessor-implemented. For example, at least some of the operations of amethod can be performed by one or more processors orprocessor-implemented modules. The performance of certain of theoperations can be distributed among the one or more processors, not onlyresiding within a single machine, but deployed across a number ofmachines. In some example embodiments, the processor or processors canbe located in a single location (e.g., within a home environment, anoffice environment or as a server farm), while in other embodiments theprocessors can be distributed across a number of locations.

The one or more processors can also operate to support performance ofthe relevant operations in a “cloud computing” environment or as a“software as a service” (SaaS). For example, at least some of theoperations can be performed by a group of computers (as examples ofmachines including processors), these operations being accessible via anetwork and via one or more appropriate interfaces (e.g., APIs).

Example embodiments can be implemented in digital electronic circuitry,or in computer hardware, firmware, software, or in combinations of them.Example embodiments can be implemented using a computer program product,e.g., a computer program tangibly embodied in an information carrier,e.g., in a machine-readable medium for execution by, or to control theoperation of, data processing apparatus, e.g., a programmable processor,a computer, or multiple computers.

A computer program can be written in any form of programming language,including compiled or interpreted languages, and it can be deployed inany form, including as a stand-alone program or as a module, subroutine,or other unit suitable for use in a computing environment. A computerprogram can be deployed to be executed on one computer or on multiplecomputers at one site or distributed across multiple sites andinterconnected by a communication network.

In example embodiments, operations can be performed by one or moreprogrammable processors executing a computer program to performfunctions by operating on input data and generating output. Methodoperations can also be performed by, and apparatus of exampleembodiments can be implemented as, special purpose logic circuitry(e.g., a FPGA or an ASIC).

A computing system can include clients and servers. A client and serverare generally remote from each other and typically interact through acommunication network. The relationship of client and server arises byvirtue of computer programs running on the respective computers andhaving a client-server relationship to each other. In embodimentsdeploying a programmable computing system, it will be appreciated thatboth hardware and software architectures merit consideration.Specifically, it will be appreciated that the choice of whether toimplement certain functionality in permanently configured hardware(e.g., an ASIC), in temporarily configured hardware (e.g., a combinationof software and a programmable processor), or a combination ofpermanently and temporarily configured hardware can be a design choice.Below are set out hardware (e.g., machine) and software architecturesthat can be deployed, in various example embodiments.

FIG. 8 is a block diagram of a machine in the example form of a computersystem 800 within which instructions 824 for causing the machine toperform any one or more of the methodologies discussed herein can beexecuted, in accordance with an example embodiment. In alternativeembodiments, the machine operates as a standalone device or can beconnected (e.g., networked) to other machines. In a networkeddeployment, the machine can operate in the capacity of a server or aclient machine in a server-client network environment, or as a peermachine in a peer-to-peer (or distributed) network environment. Themachine can be a personal computer (PC), a tablet PC, a set-top box(STB), a Personal Digital Assistant (PDA), a cellular telephone, a webappliance, a network router, switch or bridge, or any machine capable ofexecuting instructions (sequential or otherwise) that specify actions tobe taken by that machine. Further, while only a single machine isillustrated, the term “machine” shall also be taken to include anycollection of machines that individually or jointly execute a set (ormultiple sets) of instructions to perform any one or more of themethodologies discussed herein.

The example computer system 800 includes a processor 802 (e.g., acentral processing unit (CPU), a graphics processing unit (GPU) orboth), a main memory 804 and a static memory 806, which communicate witheach other via a bus 808. The computer system 800 can further include avideo display unit 810 (e.g., a liquid crystal display (LCD) or acathode ray tube (CRT)). The computer system 800 also includes analphanumeric input device 812 (e.g., a keyboard), a user interface (UI)navigation (or cursor control) device 814 (e.g., a mouse), a disk driveunit 816, a signal generation device 818 (e.g., a speaker) and a networkinterface device 820.

The disk drive unit 816 includes a machine-readable medium 822 on whichis stored one or more sets of data structures and instructions 824(e.g., software) embodying or utilized by any one or more of themethodologies or functions described herein. The instructions 824 canalso reside, completely or at least partially, within the main memory804 and/or within the processor 802 during execution thereof by thecomputer system 800, the main memory 804 and the processor 802 alsoconstituting machine-readable media. The instructions 824 can alsoreside, completely or at least partially, within the static memory 806.

While the machine-readable medium 822 is shown in an example embodimentto be a single medium, the term “machine-readable medium” can include asingle medium or multiple media (e.g., a centralized or distributeddatabase, and/or associated caches and servers) that store the one ormore instructions 824 or data structures. The term “machine-readablemedium” shall also be taken to include any tangible medium that iscapable of storing, encoding or carrying instructions for execution bythe machine and that cause the machine to perform any one or more of themethodologies of the present embodiments, or that is capable of storing,encoding or carrying data structures utilized by or associated with suchinstructions. The term “machine-readable medium” shall accordingly betaken to include, but not be limited to, solid-state memories, andoptical and magnetic media. Specific examples of machine-readable mediainclude non-volatile memory, including by way of example semiconductormemory devices (e.g., Erasable Programmable Read-Only Memory (EPROM),Electrically Erasable Programmable Read-Only Memory (EEPROM), and flashmemory devices), magnetic disks such as internal hard disks andremovable disks; magneto-optical disks; and compact disc-read-onlymemory (CD-ROM) and digital versatile disc (or digital video disc)read-only memory (DVD-ROM) disks.

The instructions 824 can further be transmitted or received over acommunications network 826 using a transmission medium. The instructions824 can be transmitted using the network interface device 820 and anyone of a number of well-known transfer protocols (e.g., HTTP). Examplesof communication networks include a LAN, a WAN, the Internet, mobiletelephone networks, POTS networks, and wireless data networks (e.g.,WiFi and WiMAX networks). The term “transmission medium” shall be takento include any intangible medium capable of storing, encoding, orcarrying instructions for execution by the machine, and includes digitalor analog communications signals or other intangible media to facilitatecommunication of such software.

Although an embodiment has been described with reference to specificexample embodiments, it will be evident that various modifications andchanges can be made to these embodiments without departing from thebroader spirit and scope of the present disclosure. Accordingly, thespecification and drawings are to be regarded in an illustrative ratherthan a restrictive sense. The accompanying drawings that form a parthereof, show by way of illustration, and not of limitation, specificembodiments in which the subject matter can be practiced. Theembodiments illustrated are described in sufficient detail to enablethose skilled in the art to practice the teachings disclosed herein.Other embodiments can be utilized and derived therefrom, such thatstructural and logical substitutions and changes can be made withoutdeparting from the scope of this disclosure. This Detailed Description,therefore, is not to be taken in a limiting sense, and the scope ofvarious embodiments is defined only by the appended claims, along withthe full range of equivalents to which such claims are entitled.

Although specific embodiments have been illustrated and describedherein, it should be appreciated that any arrangement calculated toachieve the same purpose can be substituted for the specific embodimentsshown. This disclosure is intended to cover any and all adaptations orvariations of various embodiments. Combinations of the aboveembodiments, and other embodiments not specifically described herein,will be apparent to those of skill in the art upon reviewing the abovedescription.

What is claimed is:
 1. A computer-implemented method comprising:receiving primary audio content; determining, by at least one hardwareprocessor, that at least one reference audio content satisfies apredetermined similarity threshold based on a comparison of the primaryaudio content with the at least one reference audio content; for eachone of the at least one reference audio content, identifying motionvideo content based on the motion video content being stored inassociation with the one of the at least one reference audio content andnot stored in association with the primary audio content; and causingthe identified motion video content to be displayed on a deviceconcurrently with a presentation of the primary audio content on thedevice by (i) synchronizing data of the at least one reference audiocontent with data of the primary audio content and (ii) synchronizingthe identified motion video content with the primary audio content basedon the synchronizing of the data of the at least one reference audiocontent with the data of the primary audio content.
 2. Thecomputer-implemented method of claim 1, wherein the primary audiocontent comprises music.
 3. The computer-implemented method of claim 1,wherein the comparison comprises a comparison of fingerprint data of theprimary audio content with fingerprint data of the at least onereference audio content.
 4. The computer-implemented method of claim 1,wherein the identifying of the motion video content is further based ona profile of a user associated with the device.
 5. Thecomputer-implemented method of claim 1, wherein the synchronizing of thedata of the at least one reference audio content with the data of theprimary audio content is based on a comparison of fingerprint data ofthe at least one reference audio content with fingerprint data of theprimary audio content.
 6. The computer-implemented method of claim 1,wherein the at least one reference audio content comprises at least tworeference audio content, each one of the at least two reference audiocontent being stored in association with different motion video content,the identified motion video content comprising a portion of each of thedifferent motion video content.
 7. The computer-implemented method ofclaim 1, wherein the comparison comprises a comparison of a first audiocharacteristic of the primary audio content with a second audiocharacteristic of the at least one reference audio content, wherein thefirst and second audio characteristics are selected from the groupconsisting of: chords, rhythmic features, and instrument onsets.
 8. Asystem comprising: at least one processor; and a non-transitorycomputer-readable medium storing executable instructions that, whenexecuted, cause the at least one processor to perform operationscomprising: receiving primary audio content; determining that at leastone reference audio content satisfies a predetermined similaritythreshold based on a comparison of the primary audio content with the atleast one reference audio content; for each one of the at least onereference audio content, identifying motion video content based on themotion video content being stored in association with the one of the atleast one reference audio content and not stored in association with theprimary audio content; and causing the identified motion video contentto be displayed on a device concurrently with a presentation of theprimary audio content on the device by (i) synchronizing data of the atleast one reference audio content with data of the primary audio contentand (ii) synchronizing the identified motion video content with theprimary audio content based on the synchronizing of the data of the atleast one reference audio content with the data of the primary audiocontent.
 9. The system of claim 8, wherein the primary audio contentcomprises music.
 10. The system of claim 8, wherein the comparisoncomprises a comparison of fingerprint data of the primary audio contentwith fingerprint data of the at least one reference audio content. 11.The system of claim 8, wherein the identifying of the motion videocontent is further based on a profile of a user associated with thedevice.
 12. The system of claim 8, wherein the synchronizing of the dataof the at least one reference audio content with the data of the primaryaudio content is based on a comparison of fingerprint data of the atleast one reference audio content with fingerprint data of the primaryaudio content.
 13. The system of claim 8, wherein the at least onereference audio content comprises at least two reference audio content,each one of the at least two reference audio content being stored inassociation with different motion video content, the identified motionvideo content comprising a portion of each of the different motion videocontent.
 14. The system of claim 8, wherein the comparison comprises acomparison of a first audio characteristic of the primary audio contentwith a second audio characteristic of the at least one reference audiocontent, wherein the first and second audio characteristics are selectedfrom the group consisting of: chords, rhythmic features, and instrumentonsets.
 15. A non-transitory machine-readable storage medium, tangiblyembodying a set of instructions that, when executed by at least oneprocessor, causes the at least one processor to perform operationscomprising: receiving primary audio content; determining, by at leastone hardware processor, that at least one reference audio contentsatisfies a predetermined similarity threshold based on a comparison ofthe primary audio content with the at least one reference audio content;for each one of the at least one reference audio content, identifyingmotion video content based on the motion video content being stored inassociation with the one of the at least one reference audio content andnot stored in association with the primary audio content; and causingthe identified motion video content to be displayed on a deviceconcurrently with a presentation of the primary audio content on thedevice by (i) synchronizing data of the at least one reference audiocontent with data of the primary audio content and (ii) synchronizingthe identified motion video content with the primary audio content basedon the synchronizing of the data of the at least one reference audiocontent with the data of the primary audio content.
 16. Thenon-transitory machine-readable storage medium of claim 15, wherein thecomparison comprises a comparison of fingerprint data of the primaryaudio content with fingerprint data of the at least one reference audiocontent.
 17. The non-transitory machine-readable storage medium of claim15, wherein the identifying of the motion video content is further basedon a profile of a user associated with the device.
 18. Thenon-transitory machine-readable storage medium of claim 15, wherein thesynchronizing of the data of the at least one reference audio contentwith the data of the primary audio content is based on a comparison offingerprint data of the at least one reference audio content withfingerprint data of the primary audio content.
 19. The non-transitorymachine-readable storage medium of claim 15, wherein the at least onereference audio content comprises at least two reference audio content,each one of the at least two reference audio content being stored inassociation with different motion video content, the identified motionvideo content comprising a portion of each of the different motion videocontent.
 20. The non-transitory machine-readable storage medium of claim15, wherein the comparison comprises a comparison of a first audiocharacteristic of the primary audio content with a second audiocharacteristic of the at least one reference audio content, wherein thefirst and second audio characteristics are selected from the groupconsisting of: chords, rhythmic features, and instrument onsets.